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HEPATITIS C VIRUS PROTEASE 

Cross-Refcrence to Related Applications 

This application is a continuation-in-part application of U.S. Serial No. 
07/505,433, filed on 4 April 1990. 

Technical Field 

This invention relates to the molecular biology and virology of the 
hepatitis C virus (HCV). More specifically, this invention relates to a novel protease 
produced by HCV, methods of expression, recombinant protease, protease mutants, 
and inhibitors of HCV protease. 

Background of the Invention 

Non-A, Non-B hepatitis (NANBH) is a transmissible disease (or family 
of diseases) that is believed to be virally induced, and is distinguishable from other 
forms of virus-associated liver disease, such as those caused by hepatitis A virus 
(HAV), hepatitis B virus (HBV), delta hepatitis virus (HDV), cytomegalovirus (CMV) 
or Epstein-Barr virus (EBV). Epidemiologic evidence suggests that there may be three 
types of NANBH: the water-borne epidemic type; the blood or needle associated 
type; and the sporadically occurring (community acquired) type. However, the 
number of causative agents is unknown. Recently, however, a new viral species, 
hepatitis C virus (HCV) has been identified as the primary (if not only) cause of 
blood-associated NANBH (BB-NANBH). See for example, PCT WO89/046699; U.S. 
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Patent Application Serial No. 7/456,637, filed 21 December 1989; and U.S. Patent 
Application Serial No. 7/456,637, filed 21 December 1989, incorporated herein by 
reference. Hepatitis C appears to be the major form of transfusion-associated hepatitis 
in a number of countries, including the United States and Japan. There is also 

5 evidence implicating HCV in induction of hepatocellular carcinoma. Thus, a need 
exists for an effective method for treating HCV infection: currently, there is none. 

Many viruses, including adenoviruses, baculoviruses, comoviruses, pico- 
rnaviruses, retroviruses, and togaviruses, rely on specific, virally-encoded proteases for 
processing polypeptides from their initial translated form into mature, active proteins. 

10 In the case of picornaviruses, all of the viral proteins are believed to arise from 

cleavage of a single polyprotein (B.D. Korant, CRC Crit Rev Biotech (1988) 8:149- 
57). 

S. Pichuantes et al, in "Viral Proteinases As Targets For Chemotherapy 11 
(Cold Spring Harbor Laboratory Press, 1989) pp. 215-22, disclosed expression of a 

15 viral protease found in HIV-1. The HIV protease was obtained in the form of a fusion 
protein, by fusing DNA encoding an HIV protease precursor to DNA encoding human 
superoxide dismutase (hSOD), and expressing the product in E. coli. Transformed 
cells expressed products of 36 and 10 kDa (corresponding to the hSOD-protease fusion 
protein and the protease alone), suggesting that the protease was expressed in a form 

20 capable of autocatalytic proteolysis. 

TJ. McQuade et al, Science (1990) 247:454-56 disclosed preparation of 
a peptide mimic capable of specifically inhibiting the HIV-1 protease. In HIV, the 
protease is believed responsible for cleavage of the initial p55 gag precursor transcript 
into the core structural proteins (pl7, p24, p8, and p7). Adding 1 pM inhibitor to 

25 HIV-infected peripheral blood lymphocytes in culture reduced the concentration of 
processed HIV p24 by about 70%. Viral maturation and levels of infectious virus 
were reduced by the protease inhibitor. 
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Disclosure of the Invention 

We have now invented recombinant HCV protease, HCV protease 
fusion proteins, truncated and altered HCV proteases, cloning and expression vectors 
therefore, and methods for identifying antiviral agents effective for treating HCV. 

Brief Description of the Drawings 

Figure 1 shows the sequence of HCV protease. 

Figure 2 shows the polynucleotide sequence and deduced amino acid 

sequence of the clone C20c. 

Figure 3 shows the polynucleotide sequence and deduced amino acid 
sequence of the clone C26d. 

Figure 4 shows the polynucleotide sequence and deduced amino acid 
sequence of the clone C8h. * 

Figure 5 shows the polynucleotide sequence and deduced amino acid 
sequence of the clone C7f . 

Figure 6 shows the polynucleotide sequence and deduced amino acid 
sequence of the clone C31. 

Figure 7 shows the polynucleotide sequence and deduced amino acid 
sequence of the clone C35. 

Figure 8 shows the polynucleotide sequence and deduced amino acid 
sequence of the clone C33c. 

Figure 9 schematically illustrates assembly of the vector 
C7fC20cC300C200. 

Figure 10 shows the sequence of vector cf lSODp600. 
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Modes of Carrying Out The Invention 
A. Definitions 

The terms 'Hepatitis C Virus" and ,f HCV H refer to the viral species that 
is the major etiological agent of BB-NANBH, the prototype isolate of which is 
identified in PCT WO89/046699; EPO publication 318,216; USSN 7/355,008, filed 
18 May 1989; and USSN 7/456,637, the disclosures of which are incorporated herein 
by reference. ,f HCV" as used herein includes the pathogenic strains capable of causing 
hepatitis C, and attenuated strains or defective interfering particles derived therefrom. 
The HCV genome is comprised of RNA. It is known that RNA-containing viruses 

-3 

have relatively high rates of spontaneous mutation, reportedly on the order of 10 to 
-4 

10 per incorporated nucleotide (Fields & Knipe, 'Fundamental Virology" (1986, 
Raven Press, N.Y.)). As heterogeneity and fluidity of genotype are inherent character- 
istics of RNA viruses, there will be multiple strains/isolates, which may be virulent or 
avirulent, within the HCV species. 

Information on several different strains/isolates of HCV is disclosed 
herein, particularly strain or isolate CDC/HCVI (also called HCV1), Information from 
one strain or isolate, such as a partial genomic sequence, is sufficient to allow those 
skilled in the art using standard techniques to isolate new strains/isolates and to 
identify whether such new strains/isolates are HCV. For example, several different 
strains/isolates are described below. These strains, which were obtained from a 
number of human sera (and from different geographical areas), were isolated utilizing 
the information from the genomic sequence of HCV1. 

The information provided herein suggests that HCV may be distantly 
related to the flaviviridae. The Flavivirus family contains a large number of viruses 
which are small, enveloped pathogens of man. The morphology and composition of 
Flavivirus particles are known, and are discussed in M.A. Brinton, in 'The Viruses: 
The Togaviridae And Flaviviridae 1 1 (Series eds. Fraenkel-Conrat and Wagner, vol. eds. 
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Schlesinger and Schlesinger, Plenum Press, 1986), pp. 327-374. Generally, with 
respect to morphology, Flaviviruses contain a central nucleocapsid surrounded by a 
lipid Mayer. Virions are spherical and have a diameter of about 40-50 nm. Their 
cores are about 25-30 nm in diameter. Along the outer surface of the virion envelope 
are projections measuring about 5-10 nm in length with terminal knobs about 2 nm in 
diameter. Typical examples of the family include Yellow Fever virus, West Nile 
virus, and Dengue Fever virus. They possess positive-stranded RNA genomes (about 
11,000 nucleotides) that are slightly larger than that of HCV and encode a poly protein 
precursor of about 3500 amino acids. Individual viral proteins are cleaved from this 
precursor polypeptide. 

The genome of HCV appears to be single-stranded RNA containing 
about 10,000 nucleotides. The genome is positive-stranded, and possesses a contin- 
uous translational open reading frame (ORF) that encodes a polyprotein of about 3,000 
amino acids. In the ORF, the structural proteins appear to be encoded in ap- 
proximately the first quarter of the N-terminal region, with the majority of the 
polyprotein attributed to non-structural proteins. When compared with all known viral 
sequences, small but significant co-linear homologies are observed with the non- 
structural proteins of the Flavivirus family, and with the pestiviruses (which are now 
also considered to be part of the Flavivirus family). 

A schematic alignment of possible regions of a flaviviral polyprotein 
(using Yellow Fever Virus as an example), and of a putative polyprotein encoded in 
the major ORF of the HCV genome, is shown in Figure 1. Possible domains of the 
HCV polyprotein are indicated in the figure. The Yellow Fever Virus polyprotein 
contains, from the amino terminus to the carboxy terminus, the nucleocapsid protein 
(C), the matrix protein (M), the envelope protein (E), and the non-structural proteins 1, 
2 (a+b), 3, 4 (a+b), and 5 (NS1, NS2, NS3, NS4, and NS5). Based upon the putative 
amino acids encoded in the nucleotide sequence of HCV1, a small domain at the 
extreme N-terminus of the HCV polyprotein appears similar both in size and high con- 
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tent of basic residues to the nucleocapsid protein (C) fouhd at the N-terminus of flavi- 
viral polyproteins. The non-structural proteins 2,3,4, and 5 (NS2-5) of HCV and of 
yellow fever virus (YFV) appear to have counterparts of similar size and hydropath- 
icity, although the amino acid sequences diverge. However, the region of HCV which 
would correspond to the regions of YFV polyprotein which contains the M, E, and 
NS1 protein not only differs in sequence, but also appears to be quite different in size 
and hydropathicity. Thus, while certain domains of the HCV genome may be referred 
to herein as, for example, NS1, or NS2, it should be understood that these designations 
are for convenience of reference only; there may be considerable differences between 
the HCV family and flaviviruses that have yet to be appreciated 

Due to the evolutionary relationship of the 1 strains or isolates of HCV, 
putative HCV strains and isolates are identifiable by their homology at the polypeptide 
level With respect to the isolates disclosed herein, new iHCV strains or isolates are 
expected to be at least about 40% homologous, some more than about 70% homol- 
ogous, and some even more than about 80% homologous: some may be more than 
about 90% homologous at the polypeptide leveL The techniques for determining 
amino acid sequence homology are known in the art. For example, the amino acid 
sequence may be determined directly and compared to the sequences provided herein. 
Alternatively the nucleotide sequence of the genomic material of the putative HCV 
may be determined (usually via a cDNA intermediate), the amino acid sequence 

encoded therein can be determined, and the corresponding regions compared. 

The term tr HCV protease 11 refers to an enzyme derived from HCV which 
exhibits proteolytic activity, specifically the polypeptide encoded in the NS3 domain of 
the HCV genome. At least one strain of HCV contains a protease believed to be sub- 
stantially encoded by or within the following sequence: 

Arg Arg Gly Arg Glu He Leu Leu Gly Pro 10 
Ala Asp Gly Met Val Ser Lys Gly Trp Arg 20 
Leu Leu Ala Pro He Thr Ala Tyr Ala Gin 30 
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Gin Thr Arg Gly Leu Leu Gly Cys lie lie 
Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin 
Val Glu Gly Glu Val Gin lie Val Ser Thr 
Ala Ala Gin Thr Phe Leu Ala Thr Cys He 
Asn Gly Val Cys Trp Thr Val Tyr His Gly 



40 
50 
60 
70 
80 
90 



Ala Gly Thr Arg Thr lie Ala Ser Pro Lys 



Gly Pro Val lie Gin Met Tyr Thr Asn Val 100 

Asp Gin Asp. Leu Val Gly Tip Pro Ala Ser 1 10 

Gin Gly Thr Arg Ser Leu Thr Pro Cys Thr 120 

Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr 130 

Arg His Ala Asp Val lie Pro Val Arg Arg 140 

Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser 150 

Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser 160 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly 170 

His Ala Val Gly lie Phe Arg Ala Ala Val 180 

Cys Thr Arg Gly Val Ala Lys Ala Val Asp 190 

Phe He Pro Val Glu Asn Leu Glu Thr Thr 200 

Met Arg ••• 202 

The above N and C termini are putative, the actual termini being 



defined by expression and processing in an appropriate host of a DNA construct 
encoding the entire NS3 domain. It is understood that this sequence may vary from 
strain to strain, as RNA viruses like HCV are known to exhibit a great deal of 
variation. Further, the actual N and C termini may vary, as the protease is cleaved 
from a precursor polyprotein: variations in the protease amino acid sequence can 
result in cleavage from the polyprotein at different points. Thus, the amino- and 
carboxy-termini may differ from strain to strain of HCV. The first amino acid shown 
above corresponds to residue 60 in Figure 1. However, the minimum sequence 
necessary for activity can be determined by routine methods. The sequence may be 
truncated at either end by treating an appropriate expression vector with an exonucle- 
ase (after cleavage at the 5' or 3' end of the coding sequence) to remove any desired 
number of base pairs. The resulting coding polynucleotide is then expressed and the 
sequence determined. In this manner the activity of the resulting product may be 
correlated with the amino acid sequence: a limited series of such experiments 



(removing progressively greater numbers of base pairs) determines the minimum 
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internal sequence necessary for protease activity. We have found that the sequence 
may be substantially truncated, particularly at the carboxy terminus, apparently with 
full retention of protease activity. It is presently believed that a portion of the protein 
at the caiboxy terminus may exhibit helicase activity. However, helicase activity is 
not required of the HCV proteases of the invention. The amino terminus may also be 
truncated to a degree without loss of protease activity. 

The amino acids underlined above are believed to be the residues nec- 
essary for catalytic activity, based on sequence homology to putative flavivirus serine 
proteases. Table 1 shows the alignment of the three serine protease catalytic residues 
for HCV protease and the protease obtained from Yellow Fever Virus, West Nile 
Fever virus, Murray Valley Fever virus, and Kunjin virus. Although the other four 
flavivirus protease sequences exhibit higher homology with each other than with HCV, 
a degree of homology is still observed with HCV. This homology, however, was not 
sufficient for indication by currently available alignment software. The indicated 
amino acids are numbered His^, Asp 103 , and Ser 161 in the sequence listed above (His 139l 
Asp 163 , and Ser 221 in Figure 1). 



TABLE 1: Alignment of Active Residues by Sequence 



Protease 



His 



Asp 



Ser 



HCV 

Yellow Fever 
West Nile Fever 
Murray Valley 
Kunjin Virus 



CWTVYHGAG 
FHTMWHVTR 
FHTLWHTTK 
FHTLWHTTR 
FHTLWHTTK 



DQDLGWPAP 

KEDLVAYGG 
KEDRLCYGG 
KEDRVTYGG 
KEDRLCYGG 



LKGSSGGPL 

PSGTSGSPI 

PTGTSGSPI 

PIGTSGSPI 

PTGTSGSPI 
~~ -1 



Alternatively, one can make catalytic residue 
assignments based on structural homology. Table 2 shows 
alignment of HCV with against the catalytic sites of several 
well-characterized serine proteases based on structural con- 
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siderations : protease A from Streptomyces griseus, a-lytic 
protease, bovine trypsin, chymo trypsin, and elastase (M. 
James et al, Can J Biochem (1978) 5£:396). Again, a degree 
of homology is observed. The HCV residues identified are 
numbered His 7g , Asp 125 , and Ser 161 in the sequence listed 
above . 



TABLE 2: Alignment of Active Residues by Structure 



Protease 


His 


Asp 


Ser 


S. griseus A 


TAGHC 


NNDYGII 


GDSGGSL 


a- Lytic protease 


TAGHC 


GNDRAWV 


GDSGGSW 


Bovine Trypsin 


SAAHC 


NNDIMLI 


GDSGGPV 


Chymo tryps in 


TAAHC 


NNDITLL 


GDSGGPL 


Elastase 


TAAHC 


GYDIALL 


GDSGGPL 


HCV 


TVYHG 


SSDLYLV 


GSSGGPL 



The most direct manner to verify the residues essential to the active site 
is to replace each residue individually with a residue of equivalent stearic size. This is 
easily accomplished by site-specific mutagenesis and similar methods known in the 
art. If replacement of a particular residue with a residue of equivalent size results in 

loss of activity, the essential nature of the replaced residue is confirmed. 

"HCV protease analogs" refer to polypeptides which vary from the full 
length protease sequence by deletion, alteration and/or addition to the amino acid 
sequence of the native protease. HCV protease analogs include the truncated proteases 
described above, as well as HCV protease muteins and fusion proteins comprising 
HCV protease, truncated protease, or protease muteins. Alterations to form HCV pro- 
tease muteins are preferably conservative amino acid substitutions, in which an amino 
acid is replaced with another naturally-occurring amino acid of similar character. For 
example, the following substitutions are considered "conservative": 

Gly <r) Ala; Asp Glu; 

Val <-» He <-» Leu; 
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Lys f-> Arg; 
Asn <-> Gin; and 
Phe <-> Trp <H> Tyr. 

Nonconservative changes are generally substitutions of one of the above amino acids 
with an amino acid from a different group (e.g., substituting Asn for Glu), or sub- 
stituting Cys, Met, His, or Pro for any of the above amino acids. Substitutions involv- 
ing common amino acids are conveniently performed by site specific mutagenesis of 
an expression vector encoding the desired protein, and subsequent expression of the 
altered form. One may also alter amino acids by synthetic or semi-synthetic methods. 
For example, one may convert cysteine or serine residues to selenocysteine by approp- 
riate chemical treatment of the isolated protein. Alternatively, one may incorporate 
uncommon amino acids in standard in vitro protein synthetic methods. Typically, the 
total number of residues changed, deleted or added to the native sequence in the 
muteins will be no more than about 20, preferably no more than about 10, and most 
preferably no more than about 5. 

The term fusion protein generally refers to a polypeptide comprising an 
amino acid sequence drawn from two or more individual proteins. In the present 
invention, "fusion protein" is used to denote a polypeptide comprising the HCV 
protease, truncate, mutein or a functional portion thereof, fused to a non-HCV protein 
or polypeptide ("fusion partner 11 ). Fusion proteins are most conveniently produced by 
expression of a fused gene, which encodes a portion of one polypeptide at the 5' end 
and a portion of a different polypeptide at the 3' end, where the different portions are 
joined in one reading frame which may be expressed in a suitable host. It is presently 
preferred (although not required) to position the HCV protease or analog at the car- 
boxy terminus of the fusion protein, and to employ a functional enzyme fragment at 
the amino terminus. As the HCV protease is normally expressed within a large 
polyprotein, it is not expected to include cell transport signals (e.g., export or secretion 
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signals). Suitable functional enzyme fragments are those polypeptides which exhibit a 
quantifiable activity when expressed fused to the HCV protease. Exemplary enzymes 
include, without limitation, ji-galactosidase ((5-gal), p-lactamase, horseradish per- 
oxidase (HRP), glucose oxidase (GO), human superoxide dismutase (hSOD), urease, 
5 and the like. These enzymes are convenient because the amount of fusion protein 

produced can be quantified by means of simple colorimetric assays. Alternatively, one 
may employ antigenic proteins or fragments, to permit simple detection and quantifica- 
tion of fusion proteins using antibodies specific for the fusion partner. The presently 

preferred fusion partner is hSOD. 

10 

B, General Method 

The practice of the present invention generally employs conventional 
techniques of molecular biology, microbiology, recombinant DNA, and immunology, 
which are within the skill of the art. Such techniques are explained fully in the 
15 literature. See for example J. Sambrook et al, 'Molecular Cloning; A Laboratory 
Manual (1989); "DNA Cloning", Vol. I and H (D.N Glover cd. 1985); "Oligo- 
nucleotide Synthesis" (M J. Gait ed, 1984); "Nucleic Acid Hybridization" (B.D. 
Hames & S J. Higgins eds. 1984); 'Transcription And Translation" (BX). Hames & 
S J. Higgins eds. 1984); "Animal Cell Culture" (R.L Freshney ed. 1986); "Immobil- 
20 ized Cells And Enzymes" (IRL Press, 1986); B, Perbal, "A Practical Guide To 

Molecular Cloning" (1984); the series, "Methods In Enzymology" (Academic Press, 
Inc.); "Gene Transfer Vectors For Mammalian Cells" (J.H. Miller and MJP, Calos eds. 
1987, Cold Spring Harbor Laboratory); Meth Enzymol (1987) 154 and 155 (Wu and 
Grossman, and Wu, eds., respectively); Mayer & Walker, eds. (1987), "Immunochem- 
25 ical Methods In Cell And Molecular Biology" (Academic Press, London); Scopes, 

"Protein Purification: Principles And Practice", 2nd Ed (Springer-Verlag, N.Y., 1987); 
and "Handbook Of Experimental Immunology", volumes I-IV (Weir and Blackwell, 
eds, 1986). 
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Both prokaryotic and eukaryotic host cells are useful for expressing 
desired coding sequences when appropriate control sequences compatible with the des- 
ignated host are used. Among prokaryotic hosts, E. coli is most frequently used. 
Expression control sequences for prokaryotes include promoters, optionally containing 
operator portions, and ribosome binding sites. Transfer vectors compatible with 
prokaryotic hosts are commonly derived from, for example, pBR322, a plasmid 
containing operons conferring ampicillin and tetracycline resistance, and the various 
pUC vectors, which also contain sequences conferring antibiotic resistance markers. 
These plasmids are commercially available. The markers may be used to obtain suc- 
cessful transformants by selection. Commonly used prokaryotic control sequences 
include the p-lactamase (penicillinase) and lactose promoter systems (Chang et al, 
Nature (1977) 198:1056), the tryptophan (trp) promoter system (Goeddel et al, Nuc 
Acids Res (1980) 8:4057) and the lambda-derived P L promoter and N gene ribosome 
binding site (Shimatake et al, Nature (1981) 292: 128) and the hybrid tac promoter (De 
Boer et al, Proc Nat Acad Sci USA (1983) 292:128) derived from sequences of the ttp 
and lac UV5 promoters. The foregoing systems are particularly compatible with 2?. 
coli; if desired, other prokaryotic hosts such as strains of Bacillus or Pseudomonas 
may be used, with corresponding control sequences. 

Eukaryotic hosts include without limitation yeast and mammalian cells 
in culture systems. Yeast expression hosts include Saccharomyces, Klebsiella, Picia, 
and the like. Saccharomyces cerevisiae and Saccharomyces carlsbergensis and K. 
lactis are the most commonly used yeast hosts, and are convenient fungal hosts. 
Yeast-compatible vectors carry markers which permit selection of successful trans- 
formants by conferring prototrophy to auxotrophic mutants or resistance to heavy 
metals on wild-type strains. Yeast compatible vectors may employ the 2p origin of 
replication (Broach et al, Meth Enzymol (1983) 101:307), the combination of CEN3 
and ARS 1 or other means for assuring replication, such as sequences which will result 
in incorporation of an appropriate fragment into the host cell genome. Control 
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sequences for yeast vectors are known in the art and include promoters for the 
synthesis of glycolytic enzymes (Hess et al, J Adv Enzyme Reg (1968) 7:149; 
Holland et al, Biochem (1978), 17:4900), including the promoter for 3-phospho- 
glycerate kinase (R. Hitzeman et al, J Biol Chem (1980) 255:2073). Terminators may 
also be included, such as those derived from the enolase gene (Holland, J Biol Chem 
(1981) 256: 1385). Particularly useful control systems are those which comprise the 
glyceraldehyde-3 phosphate dehydrogenase (QAPDH) promoter or alcohol dehydrogen- 
ase (ADH) regulatable promoter, terminators also derived from GAPDH, and if 
secretion is desired, a leader sequence derived from yeast a-f actor (see U.S. Pat. No. 
4,870,008, incorporated herein by reference). 

A presently preferred expression system employs the ubiquitin leader as 
the fusion partner. Copending application USSN 7/390,599 filed 7 August 1989 
disclosed vectors for high expression of yeast ubiquitin fusion proteins. Yeast 
ubiquitin provides a 76 amino acid polypeptide which is automatically cleaved from 
the fused protein upon expression. The ubiquitin amino acid sequence is as follows: 

Gin He Phe Val Lys Thr Leu Hir Gly Lys Thr He Thr 
Leu Glu Val Glu Ser Ser Asp Thr lie Asp Asn Val Lys 
Ser Lys lie Gin Asp Lys Glu Gly lie Pro Pro Asp Gin 
Gin Arg Leu lie Phe Ala Gly Lys Gin Leu Glu Asp Gly 
Arg Thr Leu Ser Asp Tyr Asn He Gin Lys Glu Ser Thr 
Leu His Leu Val Leu Arg Leu Arg Gly Gly 

See also Ozkaynak et al, Nature (1984) 312:663-66. Polynucleotides 
encoding the ubiquitin polypeptide may be synthesized by standard methods, for 
example following the technique of Barr et al, J Biol Chem (1988) 268: 1671-78 using 
an Applied Biosystem 380A DNA synthesizer. Using appropriate linkers, the 
ubiquitin gene may be inserted into a suitable vector and ligated to a sequence 
encoding the HCV protease or a fragment thereof. 

In addition, the transcriptional regulatory region and the transcriptional 
initiation region which are operably linked may be such that they are not naturally 
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associated in the wild-type organism. These systems are described in detail in EPO 
120,551, published October 3, 1984; EPO 116,201, published August 22, 1984; and 
EPO 164,556, published December 18, 1985, all of which are commonly owned with 
the present invention, and are hereby incorporated herein by reference in full. 

Mammalian cell lines available as hosts for expression are known in the 
art and include many immortalized cell lines available from the American Type 
Culture Collection (ATCC), including HeLa cells, Chinese hamster ovary (CHO) cells, 
baby hamster kidney (BHK) cells, and a number of other cell lines. Suitable 
promoters for mammalian cells are also known in the art and include viral promoters 
such as that from Simian Virus 40 (SV40) (Fiers et al, Nature (1978) 273:113), Rous 
sarcoma virus (RS V), adenovirus (ADV), and bovine papilloma virus (BPV). 
Mammalian cells may also require terminator sequences and poly-A addition 

sequences. Enhancer sequences which increase expression may also be included, and 
sequences which promote amplification of the gene may also be desirable (for example 
methotrexate resistance genes). These sequences are known in the art. 

Vectors suitable for replication in mammalian cells are known in the art, 
and may include viral replicons, or sequences which insure integration of the appro- 
priate sequences encoding HCV epitopes into the host genome. For example, another 
vector used to express foreign DNA is Vaccinia virus. In this case the heterologous 
DNA is inserted into the Vaccinia genome. Techniques for the insertion of foreign 
DNA into the vaccinia virus genome are known in the art, and may utilize, for 
example, homologous recombination. The heterologous DNA is generally inserted 
into a gene which is non-essential to the virus, for example, the thymidine kinase gene 
(tk), which also provides a selectable marker. Plasmid vectors that greatly facilitate 
the construction of recombinant viruses have been described (see, for example, 
Mackett et al, J Virol (1984) 49:857; Chakrabarti et al, Mol Cell Biol (1985) 5:3403; 
Moss, in GENE TRANSFER VECTORS FOR MAMMALIAN CELLS (Miller and 
Calos, eds., Cold Spring Harbor Laboratory, NY, 1987), p. 10). Expression of the 
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HCV polypeptide then occurs in cells or animals which are infected with the live 
recombinant vaccinia virus. 

In order to detect whether or not the HCV polypeptide is expressed 
from the vaccinia vector, BSC 1 cells may be infected with the recombinant vector 
and grown on microscope slides under conditions which allow expression. The cells 
may then be acetone-fixed, and immunofluorescence assays performed using serum 
which is known to contain anti-HCV antibodies to a polypeptide® encoded in the 
region of the HCV genome from which the HCV segment in the recombinant 
expression vector was derived. 

Other systems for expression of eukaryotic or viral genomes include 
insect cells and vectors suitable for use in these cells. These systems are known in the 
art, and include, for example, insect expression transfer vectors derived from the 
baculovirus Autographa calif ornica nuclear polyhedrosis virus (AcNPV), which is a 
helper-independent, viral expression vector. Expression vectors derived from this 
system usually use the strong viral polyhedrin gene promoter to drive expression of 
heterologous genes. Currently the most commonly used transfer vector for introducing 
foreign genes into AcNPV is pAc373 (see PCT WO89/046699 and USSN 7/456,637). 
Many other vectors known to those of skill in the art have also been designed for 
improved expression. These include, for example, pVL985 (which alters the 
polyhedrin start codon from ATG to ATT, and introduces a BamHI cloning site 32 bp 
downstream from the ATT; See Luckow and Summers, Virol (1989) 17:31). AcNPV 
transfer vectors for high level expression of nonfused foreign proteins are described in 
copending applications PCT WO89/046699 and USSN 7/456,637. A unique BamHI 
site is located following position -8 with respect to the translation initiation codon 
ATG of the polyhedrin gene. There are no cleavage sites for Smal, PstI, Bgin, Xbal 
or SstL Good expression of nonfused foreign proteins usually requires foreign genes 
that ideally have a short leader sequence containing suitable translation initiation 
signals preceding an ATG start signal. The plasmid also contains the polyhedrin poly- 
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adenylation signal and the ampicillin-resistance (amp) gene and origin of replication 
for selection and propagation in E. colt 

Methods for the introduction of heterologous DNA into the desired site 
in the baculovirus virus are known in the art (See Summer and Smith, Texas 
Agricultural Experiment Station Bulletin No. 1555; Smith et al, Mol Cell Biol (1983) 
3:2156-2165; and Luckow and Summers, Virol (1989) 17:31). For example, the 
heterologous DNA can be inserted into a gene such as the polyhedrin gene by homol- 
ogous recombination, or into a restriction enzyme site engineered into the desired 
baculovirus gene. The inserted sequences may be those which encode all or varying 
segments of the polyprotein, or other orfs which encode viral polypeptides. For 
example, the insert could encode the following numbers of amino acid segments from 
the polyprotein: amino acids 1-1078; amino acids 332-662; amino acids 406-662; 
amino acids 156-328, and amino acids 199-328. 

The signals for post-translational modifications, such as signal peptide 
cleavage, proteolytic cleavage, and phosphorylation, appear to be recognized by insect 
cells. The signals required for secretion and nuclear accumulation also appear to be 
conserved between the invertebrate cells and vertebrate cells. Examples of the signal 
sequences from vertebrate cells which are effective in invertebrate cells are known in 
the art, for example, the human interleukin-2 signal (BL2 ) which signals for secretion 
from the cell, is recognized and properly removed in insect cells. 

Transformation may be by any known method for introducing polynuc- 
leotides into a host cell, including, for example packaging the polynucleotide in a virus 
and transducing a host cell with the virus, and by direct uptake of the polynucleotide. 
The transformation procedure used depends upon the host to be transformed. Bacterial 
transformation by direct uptake generally employs treatment with calcium or rubidium 
chloride (Cohen, Proc Nat Acad Sci USA (1972) 69:2110; T. Maniatis et al, 
"Molecular Cloning; A Laboratory Manual" (Cold Spring Harbor Press, Cold Spring 
Harbor, NY, 1982). Yeast transformation by direct uptake may be carried out using 
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the method of Hinnen et al, Proc Nat Acad Sci USA (1978) 75:1929. Mammalian 
transformations by direct uptake may be conducted using the calcium phosphate 
precipitation method of Graham and Van der Eb, Virol (1978) 52:546, or the various 
known modifications thereof. Other methods for introducing recombinant 

polynucleotides into cells, particularly into mammalian cells, include dextran-mediated 
transfection, calcium phosphate mediated transfection, polybrene mediated transfection, 
protoplast fusion, electroporation, encapsulation of the polynucleotide^) in liposomes, 
and direct microinjection of the polynucleotides into nuclei. 

Vector construction employs techniques which are known in the art. 
Site-specific DNA cleavage is performed by treating with suitable restriction enzymes 
under conditions which generally are specified by the manufacturer of these 
commercially available enzymes. In general, about 1 pg of plasmid or DNA sequence 
is cleaved by 1 unit of enzyme in about 20 pL buffer solution by incubation for 1-2 hr 
at 37°C. After incubation with the restriction enzyme, protein is removed by 
phenol/chloroform extraction and the DNA recovered by precipitation with ethanoL 
The cleaved fragments may be separated using polyacrylamide or agarose gel 
electrophoresis techniques, according to the general procedures described in Meth 
Enzvmol (1980) 65:499-560. 

Sticky-ended cleavage fragments may be blunt ended using E. coli DNA 
polymerase I (Klenow fragment) with the appropriate deoxynucleotide triphosphates 
(dNTPs) present in the mixture. Treatment with S 1 nuclease may also be used, 
resulting in the hydrolysis of any single stranded DNA portions. 

Ligations are carried out under standard buffer and temperature con- 
ditions using T4 DNA ligase and ATP; sticky end ligations require less ATP and less 
ligase than blunt end ligations. When vector fragments are used as part of a ligation 
mixture, the vector fragment is often treated with bacterial alkaline phosphatase (BAP) 
or calf intestinal alkaline phosphatase to remove the 5' -phosphate, thus preventing 
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religation of the vector. Alternatively, restriction enzyme digestion of unwanted frag- 
ments can be used to prevent ligation. 

Ligation mixtures are transformed into suitable cloning hosts, such as E. 
colU and successful transformants selected using the markers incorporated (e.g., 
antibiotic resistance), and screened for the correct construction. 

Synthetic oligonucleotides may be prepared using an automated oligo- 
nucleotide synthesizer as described by Warner, DNA (1984) 3:401. If desired, the 
synthetic strands may be labeled with by treatment with polynucleotide kinase in 
the presence of 32 P-ATP under standard reaction conditions. 

DNA sequences, including those isolated from cDNA libraries, may be 
modified by known techniques, for example by site directed mutagenesis (see e.g., 
Zoller, Nuc Acids Res (1982) 10:6487). Briefly, the DNA to be modified is packaged 
into phage as a single stranded sequence, and converted to a double stranded DNA 
with DNA polymerase, using as a primer a synthetic oligonucleotide complementary to 
the portion of the DNA to be modified, where the desired modification is included in 
the primer sequence. The resulting double stranded DNA is transformed into a phage- 
supporting host bacterium. Cultures of the transformed bacteria which contain copies 
of each strand of the phage are plated in agar to obtain plaques. Theoretically, 50% of 
the new plaques contain phage having the mutated sequence, and the remaining 50% 
have the original sequence. Replicates of the plaques are hybridized to labeled 
synthetic probe at temperatures and conditions which permit hybridization with the 
correct strand, but not with the unmodified sequence. The sequences which have been 
identified by hybridization are recovered and cloned. 

DNA libraries may be probed using the procedure of Grunstein and 
Hogness Proc Nat Acad Sci USA (1975) 73:3961. Briefly, in this procedure the DNA 
to be probed is immobilized on nitrocellulose filters, denatured, and prehybridized with 
a buffer containing 0-50% formamide, 0.75 M NaCl, 75 mM Na citrate, 0.02% (wt/v) 
each of bovine serum albumin, polyvinylpyrrolidone, and Ficoll®, 50 mM Nal^PC^ 
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(pH 6.5), 0.1% SDS, and 100 ]ig/mL carrier denatured DNA. The percentage of 
formamide in the buffer, as well as the time and temperature conditions of the 
prehybridization and subsequent hybridization steps depend on the stringency required. 
Oligomeric probes which require lower stringency conditions are generally used with 
low percentages of formamide, lower temperatures, and longer hybridization times. 
Probes containing more than 30 or 40 nucleotides, such as those derived from cDNA 
or genomic sequences generally employ higher temperatures, e.g., about 40-42°C, and 
a high percentage formamide, e.g., 50%. Following prehybridization, S'-^P-labeled 
oligonucleotide probe is added to the buffer, and the filters are incubated in this 
mixture under hybridization conditions. After washing, the treated filters are subjected 

to autoradiography to show the location of the hybridized probe; DNA in correspond- 
ing locations on the original agar plates is used as the source of the desired DNA. 

For routine vector constructions, ligation mixtures are transformed into 
E. coli strain HB101 or other suitable hosts, and successful transformants selected by 
antibiotic resistance or other markers. Plasmids from the transformants are then 
prepared according to the method of Clewell et al, Proc Nat Acad Sci USA (1969) 
62:1159, usually following chloramphenicol amplification (Clewell, J Bacteriol (1972) 

110: 667). The DNA is isolated and analyzed, usually by restriction enzyme analysis 
and/or sequencing. Sequencing may be performed by the dideoxy method of Sanger et 
al, Proc Nat Acad Sci USA (1977) 74:5463, as further described by Messing et al, 
Nuc Acids Res (1981) 9:309, or by the method of Maxam et al, Meth Enzvmol (1980) 
65:499. Problems with band compression, which are sometimes observed in GC-rich 
regions, were overcome by use of T-deazoguanosine according to Barr et al, 
Biotechniques (1986) 4:428. 

The enzyme-linked immunosorbent assay (ELISA) can be used to meas- 
ure either antigen or antibody concentrations. This method depends upon conjugation 
of an enzyme to either an antigen or an antibody, and uses the bound enzyme activity 
as a quantitative label. To measure antibody, the known antigen is fixed to a solid 
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phase (e.g., a microliter dish, plastic cup, dipstick, plastic bead, or the like), incubated 
with test serum dilutions, washed, incubated with antiimmunoglobulin labeled with an 
enzyme, and washed again. Enzymes suitable for labeling are known in the art, and 
include, for example, horseradish peroxidase (HRP). Enzyme activity bound to the 
solid phase is usually measured by adding a specific substrate, and determining prod- 
uct formation or substrate utilization colorimetrically. The enzyme activity bound is a 
direct function of the amount of antibody bound. 

To measure antigen, a known specific antibody is fixed to the solid 
phase, the test material containing antigen is added, after an incubation the solid phase 
is washed, and a second enzyme-labeled antibody is added. After washing, substrate 
is added, and enzyme activity is measured colorimetrically, and related to antigen 
concentration. 

Proteases of the invention may be assayed for activity by cleaving a 
substrate which provides detectable cleavage products. As the HCV protease is 
believed to cleave itself from the genomic polyprotein, one can employ this autocata- 
lytic activity both to assay expression of the protein and determine activity. For 
example, if the protease is joined to its fusion partner so that the HCV protease N- 
terminal cleavage signal (Arg-Arg) is included, the expression product will cleave 
itself into fusion partner and active HCV protease. One may then assay the products, 
for example by western blot, to verify that the proteins produced correspond in size to 
the separate fusion partner and protease proteins. It is presently preferred to employ 
small peptide p-nitrophenyl esters or methylcoumarins, as cleavage may then be 
followed by spectrophotometric or fluorescent assays. Following the method described 
by E.D. Matayoshi et al, Science (1990) 247:231-35, one may attach a fluorescent 
label to one end of the substrate and a quenching molecule to the other end: cleavage 
is then determined by measuring the resulting increase in fluorescence. If a suitable 
enzyme or antigen has been employed as the fusion partner, the quantity of protein 
produced may easily be determined. Alternatively, one may exclude the HCV 
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protease N-terminal cleavage signal (preventing self-cleavage) and add a separate 
cleavage substrate, such as a fragment of the HCV NS3 domain including the native 
processing signal or a synthetic analog. 

In the absence of this protease activity, the HCV polyprotein should 
remain in its unprocessed form, and thus render the virus noninfectious. Thus, the 
protease is useful for assaying pharmaceutical agents for control of HCV, as com- 
pounds which inhibit the protease activity sufficiently will also inhibit viral infectivity. 
Such inhibitors may take the form of organic compounds, particularly compounds 
which mimic the cleavage site of HCV recognized by the protease. Three of the 
putative cleavage sites of the HCV polyprotein have the following amino acid 
sequences: 

Val-Ser-Ala-Arg-Arg // Gly-Arg-Glu-ne-Leu-Leu-Gly 1 
Ala-Ile-Leu-Arg-Arg // His-Val-Gly-Pro- 
Val-Ser-Cys-Gln-Arg // Gly-Tyr- 

These sites are characterized by the presence of two basic amino acids 
immediately before the cleavage site, and are similar to the cleavage sit&s recognized 
by other flavivirus proteases. Thus, suitable protease inhibitors may be prepared 
which mimic the basic/basic/small neutral motif of the HCV cleavage sites, but 
substituting a nonlabUe linkage for the peptide bond cleaved in the natural substrate. 
Suitable inhibitors include peptide trifluoromethyl ketones, peptide boronic acids, 
peptide a-ketoesters, peptide difluoroketo compounds, peptide aldehydes, peptide 
diketones, and the like. For example, the peptide aldehyde N-acetyl-phenylalanyl- 
glycinaldehyde is a potent inhibitor of the protease papain. One may conveniently 
prepare and assay large mixtures of peptides using the methods disclosed in U.S. 
Patent application Serial No. 7/189,318, filed 2 May 1988 (published as PCT 
WO89/10931), incorporated herein by reference. This application teaches methods for 
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generating mixtures of peptides up to hexapeptides having all possible amino acid 
sequences, and further teaches assay methods for identifying those peptides capable of 
binding to proteases. 

Other protease inhibitors may be proteins, particularly antibodies and 
5 antibody derivatives. Recombinant expression systems may be used to generate 
quantities of protease sufficient for production of monoclonal antibodies (MAbs) 
specific for the protease. Suitable antibodies for protease inhibition will bind to the 
protease in a manner reducing or eliminating the enzymatic activity, typically by 
obscuring the active site. Suitable MAbs may be used to generate derivatives, such as 

10 Fab fragments, chimeric antibodies, altered antibodies, univalent antibodies, and single 
domain antibodies, using methods known in the art. 

Protease inhibitors are screened using methods of the invention. In gen- 
eral, a substrate is employed which mimics the enzyme's natural substrate, but which 
provides a quantifiable signal when cleaved. The signal is preferably detectable by 

15 colorimetric or fluorometric means: however, other methods such as HPLC or silica 

gel chromatography, GC-MS, nuclear magnetic resonance, and the like may also be 
useful. After optimum substrate and enzyme concentrations are determined, a 
candidate protease inhibitor is added to the reaction mixture at a range of 
concentrations. The assay conditions ideally should resemble the conditions under 

20 which the protease is to be inhibited in vivo, i.e., under physiologic pH, temperature, 
ionic strength, etc. Suitable inhibitors will exhibit strong protease inhibition at con- 
centrations which do not raise toxic side effects in the subject Inhibitors which 
compete for binding to the protease active site may require concentrations equal to or 
greater than the substrate concentration, while inhibitors capable of binding irrev- 

2 5 ersibly to the protease active site may be added in concentrations on the order of the 
enzyme concentration. 

In a presently preferred embodiment, an inactive protease mutein is 
employed rather than an active enzyme. It has been found that replacing a critical 
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residue within the active site of a protease (e.g., replacing the active site Ser of a 
serine protease) does not significantly alter the structure of the enzyme, and thus 
preserves the binding specificity. The altered enzyme still recognizes and binds to its 
proper substrate, but fails to effect cleavage. Thus, in one method of the invention an 
inactivated HCV protease is immobilized, and a mixture of candidate inhibitors added 
Inhibitors that closely mimic the enzyme's preferred recognition sequence will 
compete more successfully for binding than other candidate inhibitors. The poorly- 
binding candidates may then be separated, and the identity of the strongly-binding 
inhibitors determined For example, HCV protease may be prepared substituting Ala 
for Ser^x (Fig. 1), providing an enzyme capable of binding the HCV protease sub- 
strate, but incapable of cleaving it. The resulting protease mutein is then bound to a 
solid support, for example Sephadex® beads, and packed into a column. A mixture of 
candidate protease inhibitors in solution is then passed through the column and 
fractions collected. The last fractions to elute will contain the strongest-binding 
compounds, and provide the preferred protease inhibitor candidates. 

Protease inhibitors may be administered by a variety of methods, such 
as intravenously, orally, intramuscularly, intraperitoneally, bronchially, intranasally, 
and so forth. The preferred route of administration will depend upon the nature of the 
inhibitor. Inhibitors prepared as organic compounds may often be administered orally 
(which is generally preferred) if well absorbed. Protein-based inhibitors (such as most 
antibody derivatives) must generally be administered by parenteral routes. 

C. Examples 

The examples presented below are provided as a further guide to the 
practitioner of ordinary skill in the art, and are not to be construed as limiting the 
invention in any way. 
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Example 1 
(Preparation of HCV cDNA) 
A genomic library of HCV cDNA was prepared as described in PCT 
WO89/046699 and USSN 7/456,637. This library, ATCC accession no. 40394, has 

5 been deposited as set forth below. 

Example 2 

(Expression of the Polypeptide Encoded in Clone 5-1-1.) 
(A) The HCV polypeptide encoded within clone 5-1-1 (see 

10 Example 1) was expressed as a fusion -polypeptide with human superoxide dismutase 
(SOD). This was accomplished by subcloning the clone 5-1-1 cDNA insert into the 
expression vector pSODCFl (K.S. Steimer et al, J Virol (1986) 58:9; EPO 138,111) 
as follows. The SOD/5-1-1 expression vector was transformed into E. coli D1210 
cells. These cells, named Cf 1/5-1-1 in E. colt, were deposited as set forth below and 
±5 have an ATCC accession no. of 67967. 

First, DNA isolated from pSODCFl was treated with BamHI and 
EcoRI, and the following linker was ligated into the linear DNA created by the 
restriction enzymes: 

GAT CCT GGA ATT CTG ATA AGA CCT TAA GAC TAT TTT AA 

20 

After cloning, the plasmid containing the insert was isolated. 

Plasmid containing the insert was restricted with EcoRL The HCV 
cDNA insert in clone 5-1-1 was excised with EcoRI, and ligated into this EcoRI lin- 
earized plasmid DNA. The DNA mixture was used to transform E. coli strain D1210 
5 (Sadler et al, Gene (1980) 8:279). Recombinants with the 5-1-1 cDNA in the correct 
orientation for expressing the ORF shown in Figure 1 were identified by restriction 
mapping and nucleotide sequencing. 

Recombinant bacteria from one clone were induced to express the SOD- 
HCV<- , « polypeptide by growing the bacteria in the presence of IPTG. 
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Three separate expression vectors, pcflAB, pcflCD, and pcflEF were 
created by ligating three new linkers, AB, CD, and EF to a BamHI-EcoRI fragment 
derived by digesting to completion the vector pSODCFl with EcoRI and BamHI, 
followed by treatment with alkaline phosphatase. The linkers were created from six 
oligomers, A, B, C, D, E, and F. Each oligomer was phosphorylated by treatment 
with kinase in the presence of ATP prior to annealing to its complementary oligomer. 
The sequences of the synthetic linkers were the following: 

Name DNA Sequence (5' to 3 7 ) 



A 


GATC 


CTG 


AAT 


TCC 


TGA 


TAA 


B 




GAC 


'1TA 


AGG 


ACT 


ATTTTA A 


C 


GATC 


CGA 


ATT 


CTG 


TGA 


TAA 


D 




GCT 


TAA 


GAC 


ACT 


ATTTTA A 


E 


GATC 


CTG 


GAA 


TTC 


TGA 


TAA 


F 




GAC 


CTT 


AAG 


ACT 


ATT TTA A 



Each of the three linkers destroys the original EcoRI site, and creates a 
new EcoRI site within the linker, but within a different reading frame. Thus, the HCV 
cDNA EcoRI fragments isolated from the clones, when inserted into the expression 
vector, were in three different reading frames. 

The HCV cDNA fragments in the designated Xgtll clones were excised 
by digestion with EcoRI; each fragment was inserted into pcflAB, pcflCD, and 
pcflEF. These expression constructs were then transformed into D1210 E. coli cells, 
the transformants cloned, and polypeptides expressed as described in part B below. 

(B) Expression products of the indicated HCV cDNAs were 

tested for antigenicity by direct immunological screening of the colonies, using a 
modification of the method described in Helfman et al, Proc Nat Acad Sci USA 
(1983), 80:31. Briefly, the bacteria were plated onto nitrocellulose filters overlaid on 
ampicillin plates to give approximately 40 colonies per filter. Colonies were replica 
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plated onto nitrocellulose filters, and the replicas were regrown overnight in the pres- 
ence of 2 mM IPTG and ampicillin. The bacterial colonies were lysed by suspending 
the nitrocellulose filters for about 15 to 20 min in an atmosphere saturated with 
CHCl^ vapor. Each filter then was placed in an individual 100 mm Petri dish contain- 
ing 10 mL of 50 mM Tris HC1, pH 7.5, 150 mM NaCl, 5 mM MgC^, 3% (w/v) BSA, 
40 jig/mL lysozyme, and 0.1 ]ig/mL DNase. The plates were agitated gently for at 
least 8 hours at room temperature. The filters were rinsed in TBST (50 mM Tris HC1, 
pH 8.0, 150 mM NaCl, 0.005% Tween® 20). After incubation, the cell residues were 
rinsed and incubated for one hour in TBS (TBST without Tween®) containing 10% 
sheep serum. The filters were then incubated with pretreated sera in TBS from 
individuals with NANBH, which -included 3 chimpanzees; 8 patients with chronic 
NANBH whose sera were positive with respect to antibodies to HCV C100-3 
polypeptide (also called C100); 8 patients with chronic NANBH whose sera were 
negative for anti-ClOO antibodies; a convalescent patient whose serum was negative 
for anti-ClOO antibodies; and 6 patients with community-acquired NANBH, including 
one whose sera was strongly positive with respect to anti-ClOO antibodies, and one 
whose sera was marginally positive with respect to anti-ClOO antibodies. The sera, 
diluted in TBS, was pretreated by preabsorption with hSOD for at least 30 minutes at 
37°C. After incubation, the filters were washed twice for 30 min with TBST. The 
expressed proteins which bound antibodies in the sera were labeled by incubation for 2 
hours with l25 I-labeled sheep anti-human antibody. After washing, the filters were 
washed twice for 30 min with TBST, dried, and autoradiographed. 

Example 3 

(Cloning of Full-Length SOD-Protease Fusion Proteins) 
(A) PBR322-C200: 

The nucleotide sequences of the HCV cDNAs used below were deter- 
mined essentially as described above, except that the cDNA excised from these phages 
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were substituted for the cDNA isolated from clone 5-1-1. 

Clone C33c was isolated using a hybridization probe having the follow- 
ing sequence: 

5' ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT 3' 
The sequence of the HCV cDNA in clone C33c is shown in Figure 8, which also 
shows the amino acids encoded therein. 

Clone 35 was isolated by screening with a synthetic polynucleotide hav- 
ing the sequence: 

5' AAG CCA CCG TGT GCG CTA GGG CTC AAG CCC 3' 
Approximately 1 in 50,000 clones hybridized with the probe. The polynucleotide and 
deduced amino acid sequences for C35 are shown in Figure 7. 

Clone C31 is shown in Figure 6, which also shows the amino acids 
encoded therein, A C200 cassette was constructed by ligating together a 718 bp frag- 
ment obtained by digestion of clone C33c DNA with EcoRI and Hinfl, a 179 bp 
fragment obtained by digestion of clone C31 DNA with Hinfl and Bgll, and a 377 bp 
fragment obtained by digesting clone C35 DNA with Bgll and EcoRI. The construct 
of ligated fragments were inserted into the EcoRI site of pBR322, yielding the plasmid 
PBR322-C200. 

(B) C7f+C20c: 

Clone 7f was isolated using a probe having the sequence: 
5'-AGC AGA CAA GGG GCC TCC TAG GGT GCA TAA T-3' 
The sequence of HCV cDNA in clone 7f and the amino acids encoded therein are 
shown in Figure 5. 

Clone C20c is isolated using a probe having the following sequence: 

5'-TGC ATC AAT GGG GTG TGC TGG-3 ' 
The sequence of HCV cDNA in clone C20c, and the amino acids 
encoded therein are shown in Figure 2. 
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Clones 7f and C20c were digested with EcoRI and SfaNI to form 400 
bp and 260 bp fragments, respectively. The fragments were then cloned into the 
EcoRI site of pBR322 to form the vector C7f+C20c, and transformed into HB101 
cells, 

(C) C300 : 

Clone 8h was isolated using a probe based on the sequence of nucleo- 
tides in clone 33c. The nucleotide sequence of the probe was 

5'-AGA GAC AAC CAT GAG GTC CCC GGT GTT C-3\ 
The sequence of the HCV cDNA in clone 8h, and the amino acids encoded therein, 
are shown in Figure 4. 

Clone C26d is isolated using a probe having the following sequence: 

5'-CTG TTG TGC CCC GCG GCA GCC-3' 
The sequence and amino acid translation of clone C26d is shown in 

Figure 3. 

Clones C26d and C33c (see part A above) were transformed into the 
methylation minus E. coli strain GM48. Clone C26d was digested with EcoRII and 
Ddel to provide a 100 bp fragment. Clone C33c was digested with EcoRII and EcoRI 
to provide a 700 bp fragment. Clone C8h was digested with EcoRI and Ddel to 
provide a 208 bp fragment These three fragments were then ligated into the EcoRI 
site of pBR322, and transformed into E. coli HB101, to provide the vector C300. 

(D) Preparation of Full Length Clones: 

A 600 bp fragment was obtained from C7f+C20c by digestion with 
EcoRI and Nael, and ligated to a 945 bp Nael/EcoRI fragment from C300, and the 
construct inserted into the EcoRI site of pGEM4Z (commercially available from 
Promega) to form the vector C7fC20cC300. 

C7fC20cC300 was digested with Ndel and EcoRI to provide a 892 bp 
fragment, which was ligated with a 1160 bp fragment obtained by digesting C200 with 
Ndel and EcoRI. The resulting construct was inserted into the EcoRI site of pBR322 
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to provide the vector C7fC20cC300C200. Construction of this vector is illustrated 
schematically in Figure 9. 

Example 4 
(Preparation of E. coli Expression Vectors) 

(A) cflSODp600: 

This vector contains a full-length HCV protease coding sequence fused 
to a functional hSOD leader. The vector C7fC20cC300C200 was cleaved with EcoRI 
to provide a 2000 bp fragment, which was then ligated into the EcoRI site of plasmid 

cflCD (Example 2A). The resulting vector encodes amino acids 1-151 of hSOD, and 
amino acids 946-1630 of HCV (numbered from the beginning of the polyprotein, cor- 
responding to amino acids 1-686 in Figure 1). The vector was labeled cflSODp600 
(sometimes referred to as P600), and was transformed into E. coli D1210 cells. These 
cells, ATCC accession no. 68275, were deposited as set forth below. 

(B) P190: 

A truncated SOD-protease fusion polynucleotide was prepared by excis- 
ing a 600 bp EcoRI/Nael fragment from C7f+C20c, blunting the fragment with 
Klenow fragment, ligating the blunted fragment into the Klenow-blunted EcoRI site of 
cf 1EF (Example 2A). This polynucleotide encodes a fusion protein having amino 
acids 1-151 of hSOD, and amino acids 1-199 of HCV protease, 

(C) P300: 

A longer truncated SOD-protease fusion polynucleotide was prepared by 
excising an 892 bp EcoRI/Ndel fragment from C7fC20cC300, blunting the fragment 
with Klenow fragment, ligating the blunted fragment into the Klenow-blunted EcoRI 
site of cflEF. This polynucleotide encodes a fusion protein having amino acids 1-151 
of hSOD, and amino acids 1-299 of HCV protease. 



0100.002 
PATENT 

* 

r 30 - 

(D) P500: 

A longer truncated SOD-protease fusion polynucleotide was prepared by 
excising a 1550 bp EcoRI/EcoRI fragment from C7fC20cC300, and ligating the 
fragment into the EcoRI site of cflCD to form P500. This polynucleotide encodes a 
fusion protein having amino acids 1-151 of hSOD, and amino acids 946-1457 of HCV 
protease (amino acids 1-513 in Figure 1). 

(E) FLAG/Protease Fusion 

This vector contains a full-length HCV protease coding sequence fused 
to the FLAG sequence, Hopp et aL (1988) Biotechnology 6: 1204-1210. PCR was 
used to produce a HCV protease gene with special restriction ends for cloning ease. 
Plasmid p500 was digested with EcoRI a[nd Ndel to yield a 900 bp fragment This 
fragment and two primers were used in a polymerase chain reaction to introduce a 
unique Bgin site at amino acid 1009 and a stop codon with a Sail site at amino acid 
1262 of the HCV-1, as shown in Figure 17 of WO 90/11089, published 4 October 
1990. The sequence of the primers is as follows: 

5' CCC GAG CAA GAT CTC CCG GCC C 3' 
and 

5' CCC GGC TGC ATA AGC AGT CGA CTT GGA 3' 
After 30 cycles of PCR, the reaction was digested with Bgin and Sail, and the 710 bp 
fragment was isolated. This fragment w»s annealed and ligated to the following 

duplex: 

MetAspTyrLysAspAspAspAspLysGlyArgGlu 
CATGGACTACAAAGACGATGACGATAAAGGCCGGGA 

CTGATGTTTCTGCTACTGCTATTTCCGGCCCTCTAG 

The duplex encodes the FLAG sequence, and initiator methionine, and a 5' Ncol 
restriction site. The resulting Ncol/Sall fragment was ligated into a derivative of 
pCFl. 

This construct is then transformed into E. coli D1210 cells and expression of 
the protease is induced by the addition of BPTG. 
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The FLAG sequence was fused to the HCV protease to facilitate purification. 
A calcium dependent monoclonal antibody, which binds to the FLAG encoded peptide, 
is used to purify the fusion protein without harsh eluting conditions. 

Example 5 

(£. coli Expression of SOD-Protease Fusion Proteins) 

(A) E. coli D1210 cells were transformed with cflSODp600 and grown in 
Luria broth containing 100 pg/mL ampicillin to an OD of 0.3-0.5. BPTG was then 
added to a concentration of 2 mM, and the cells cultured to a final OD of 0.9 to 1.3. 
The cells were then lysed, and the lysate analyzed by Western blot using anti-HCV 
sera, as described in USSN 7/456,637. 

Hie results indicated the occurrence of cleavage, as no full length product 
(theoretical Mr 93 kDa) was evident on the gel. Bands corresponding to the hSOD 
fusion partner and the separate HCV protease appeared at relative molecular weights 
of about 34, 53, and 66 kDa. The 34 kDa band corresponds to the hSOD partner 
(about 20 kDa) with a portion of the NS3 domain, while the 53 and 66 kDa bands cor- 
respond to HCV protease with varying degrees of (possibly bacterial) processing. 

(B) E. coli D1210 cells were transformed with P500 and grown in Luria 
broth containing 100 pg/mL ampicillin to an OD of 0.3-0.5. IPTG was then added to 
a concentration of 2 mM, and the cells cultured to a final OD of 0.8 to L0. The cells 
were then lysed, and the lysate analyzed as described above. 

The results again indicated the occurrence of cleavage, as no full length 
product (theoretical Mr 73 kDa) was evident on the gel. Bands corresponding to the 
hSOD fusion partner and the truncated HCV protease appeared at molecular weights 
of about 34 and 45 kDa, respectively. 

(C) E. coli D1210 cells were transformed with vectors P300 and P190 and 
grown as described above. 
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The results from P300 expression indicated the occurrence of cleavage, as no 
full length product (theoretical Mr 51 kDa) was evident on the geL A band 
corresponding to the hSOD fusion partner appeared at a relative molecular weight of 
about 34. The corresponding HCV protease band was not visible, as this region of the 
NS3 domain is not recognized by the sera employed to detect the products. However, 
appearance of the hSOD band at 34 kDa rather than 51 kDa indicates that cleavage 
occurred. 

The P190 expression product appeared only as the full (encoded) length 
product without cleavage, forming a band at about 40 kDa, which corresponds to the 
theoretical molecular weight for the uncleaved product. This may indicate that the 
minimum essential sequence for HCV protease extends to the region between amino 
acids 199 and 299. 

Example 6 

(Purification of E. coli Expressed Protease) 
The HCV protease and fragments expressed in Example 5 may be purified as 
follows: 

The bacterial cells in which the polypeptide was expressed are subjected to 
osmotic shock and mechanical disruption, the insoluble fraction containing the protease 
is isolated and subjected to differential extraction with an alkaline-NaCl solution, and 
the polypeptide in the extract purified by chromatography on columns of S- 
Sepharose® and Q-Sepharose®. 

The crude extract resulting from osmotic shock and mechanical disruption is 
prepared by suspending 1 g of the packed cells in 10 mL of a solution containing 0.02 
M Tris HC1, pH 7.5, 10 mM EDTA, 20% sucrose, and incubating for 10 minutes on 
ice. The cells are then pelleted by centrifugation at 4,000 x g for 15 min at 4°G. 
After the supernatant is removed, the cell pellets are resuspended in 10 mL of Buffer 
Al (0.01 M Tris HC1, pH 7.5, 1 mM EDTA, 14 mM P-mercaptoethanol - "(ME"), 
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and incubated on ice for 10 minutes. The cells are again pelleted at 4,000 x g for 15 
minutes at 4°G. After removal of the clear supernatant (periplasmic fraction I), the 
cell pellets are resuspended in Buffer Al, incubated on ice for 10 minutes, and again 
centrifiiged at 4,000 x g for 15 minutes at 4°G* The clear supernatant (periplasmic 
fraction II) is removed, and the cell pellet resuspended in 5 mL of Buffer T2 (0.02 M 
Tris HC1, pH 7.5, 14 mM pME, 1 mM EDTA, 1 mM PMSF). In order to disrupt the 
cells, the suspension (5 mL) and 7.5 mL of Dyno-mill lead-free acid washed glass 
beads (0.10-0.15 mm diameter) (available from Glen-Mills, Inc.) are placed in a 
Falcon tube and vortexed at top speed for two minutes, followed by cooling for at 
least 2 min on ice. The vortexing-cooling procedure is repeated another four times. 
After vortexing, the slurry is filtered through a sintered glass funnel using low suction, 
the glass beads washed twice with Buffer A2, and the filtrate and washes combined. 

The insoluble fraction of the crude extract is collected by centrifugation at 
20,000 x g for 15 min at 4°C, washed twice with 10 mL Buffer A2, and resuspended 
in 5 mL of MBLLI-Q water. 

A fraction containing the HCV protease is isolated from the insoluble material 
by adding to the suspension NaOH (2 M) and NaCl (2 M) to yield a final concentation 
of 20 mM each, vortexing the mixture for 1 minute, centrifuging it 20,000 x g for 20 
min at 4°C, and retaining the supernatant. 

The partially purified protease is then purified by SDS-PAGE. The protease 
may be identified by western blot, and the band excised from the gel. The protease is 
then eluted from the band, and analyzed to confirm its amino acid sequence. N- 
terminal sequences may be analyzed using an automated amino acid sequencer, while 
C-terminal sequences may be analyzed by automated amino acid sequencing of a 
series of tryptic fragments. 
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Example 7 
(Preparation of Yeast Expression Vector) 
(A) P650 (SOD/Protease Fusion) 

This vector contains HCV sequence, which includes the wild-type full-length 
HCV protease coding sequence, fused at the 5* end to a SOD coding sequence. Two 
fragments, a 441 bp EcoRI/BgUI fragment from clone lib and a 1471 bp BglH/EcoRI 
fragment from expression vector P500, were used to reconstruct a wild-type, full- 
length HCV protease coding sequence. These two fragments were ligated together 
with an EcoRI digested pS356 vector to produce an expression cassette. The 
expression cassette encodes the ADH2/GAPDH hybrid yeast promoter, human SOD, 
the HCV protease, and a GAPDH transcription terminator. The resulting vector was 
digested with BamHI and a 4052 bp fragment was isolated. This fragment was ligated 
to the BamHI digested pAB24 vector to produce p650. p650 expresses a polyprotein 
containing, from its amino terminal end, amino acids 1-154 of hSOD, an 
oligopeptide -Asn-Leu-Gly-Ile-Arg- , and amino acids 819 to 1458 of HCV-1, as 
shown in Figure 17 of WO 90/11089, published 4 October 1990. 

Clone lib was isolated from the genomic library of HCV cDNA, ATCC 
accession no. 40394, as described above in Example 3A, using a hybridization probe 
having the following sequence: 

5* CAC CTA TGT TTA TAA CCA TCT CAC TCC TCT 3*. 
This procedure is also described in EPO Pub. No. 318 216, Example IV.A.17. 

The vector pS3EF, which is a pBR322 derivative, contains the ADH2/GAPDH 
hybrid yeast promoter upstream of the human superoxide dimutase gene, an adaptor, 
and a downstream yeast effective transcription terminator. A similar expression vector 
containing these control elements and the superoxide dismutase gene is described in 
Cousens et al. (1987) Gene 61: 265, and in copending application EPO 196,056, 
published October 1, 1986. pS3EF, however, differs from that in Cousens et al. in 
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that the heterologous proinsulin gene and the immunoglobulin hinge are deleted, and 
Gln 154 of SOD is followed by an 

adaptor sequence which contains an EcoRI site. The sequence of the adaptor is: 

5' AAT TTG GGA ATT CCA TAA TTA ATT AAG 3' 

3' AC CCT TAA GGT ATT AAT TAA TTC AGCT 5' 

The EcoRI site facilitates the insertion of heterologous sequences. Once inserted into 
pS3EF, a SOD fusion is expressed which contains an oligopeptide that links SOD to 
the heterologous sequences. pS3EF is exactly the same as pS356 except that pS356 
contains a different adaptor. The sequence of the adaptor is shown below: 

5' AAT TTG GGA ATT CCA TAA TGA G 3' 

3' AC CCT TAA GGT ATT ACT CAG CT 5' 

pS356, ATCC accession no. 67683, is deposited as set forth below. 

Plasmid pAB24 is a yeast shuttle vector, which contains pBR322 
sequences, the complete 2p sequence for DNA replication in yeast (Broach (1981) in: 
Molecular Biology of the Yeast Saccharomyces, Vol. 1, p. 445, Cold spring Harbor 
Press.) and the yeast LEU 2 * 3 gene derived from plasmid pCl/1, described in EPO Pub. 
No. 116 201. Plasmid pAB24 was constructed by digesting YEp24 with EcoRI and 
re-ligating the vector to remove the partial 2 micron sequences. The resulting plasmid, 
YEp24deltaRI, was linearized with Clal and ligated with the complete 2 micron 
plasmid which had been linearized with Clal. The resulting plasmid, pCBou, was then 
digested with Xbal, and the 8605 bp vector fragment was gel isolated. This isolated 
Xbal fragment was ligated with a 4460 bp Xbal fragment containing the LEU 241 gene 
isolated from pCl/1; the orientation of LEU 24 gene is in the same direction as the 
URA3 gene. 

S. cerevisae, 2150-2-3 (pAB24-GAP-env2), accession no. 20827, is 
deposited with the American Type Culture Collection as set forth below. The plasmid 
pAB24-GAP-env2 can be recovered from the yeast cells by known techniques. The 
GAP-env2 expression cassette can be removed by digesting pAB24-GAP-env2 with 
BamHI. pAB24 is recovered by religating the vector without the BamHI insert. 
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Example 8 

5 (Yeast Expression of SOD-Protease Fusion Protein) 

' p650 was transformed in 5. cerevisae strain JSC310, Mata, leu2, ura3- 

52, prbl-1122, pep4-3, prcl-407, cir°: DM15 (g418 resistance). The transformation is 
5 as described by Hinnen et al. (1978) Proc Natl Acad Sci USA 75: 1929. The 
transformed cells were selected on ura- plates with 8% glucose. The plates were 
incubated at 30°C for 4-5 days. The tranformants were further selected on leu- plates 
with 8% glucose putatively for high numbers of the p650 plasmid. Colonies from the 
leu- plates were inoculated into leu- medium with 3% glucose. These cultures were 
10 shaken at 30°C for 2 days and then diluted 1/20 into YEPD medium with 2% glucose 
and shaken for 2 more days at 30°C 

& cerevisae JSC310 contains DM15 DNA, described in EPO Pub. No. 
340 986, published 8 NOvember 1989. This DM15 DNA enhances ADH2 regulated 
expression of heterologous proteins. pDM15, accession no. 40453, is deposited with 
15 the American Type Culture Collection as set forth below. 

: Example 9 

(Yeast Ubiquitin Expression of Mature HCV Protease) 
Mature HCV protease is prepared by cleaving vector 
20 C7fC20cC300C200 with EcoRI to obtain a 2 Kb coding sequence, and inserting the 
sequence with the appropriate linkers into a ubiquitin expression vector, such as that 
described in WO 88/02406, published 7 April 1988, or USSN 7/390,599 filed 7 
August 1989, incorporated herein by reference. Mature HCV protease is recovered 
upon expression of the vector in suitable hosts, particularly yeast. Specifically, the 
25 yeast expression protocol described in Example 8 is used to express a ubiquitin/HCV 
protease vector. 
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Example 10 
(Preparation of an In- Vitro Expression Vector) 

(A) pGEM®-3ZTrellow Fever Leader Vector 

Four synthetic DNA fragments were annealed and ligated** together to 
5 create a EcoRl/SacI Yellow Fever leader, which was ligated to a EcoRI/SacI digested 
pGEM®-3Z vector from Promega®. The sequence of the four fragments are listed 
below: 
YFK-1: 

5' AAT TCG TAA ATC CTG TGT GCT AAT TGA GGT GCA TTG GTC TGC 
10 AAA TCG AGT TGC TAG GCA ATA AAC ACA TT 3' 
YFK-2: 

5' TAT TGC CTA GCA ACT CGA TTT GCA GAC CAA TGC ACC TCA ATT 

AGC ACA CAG GAT TTA CG 3' 

YFK-3: 

15 5' TGG ATT AAT TTT AAT CGT TCG TTG AGC GAT TAG CAG AGA ACT 
GAC CAG AAC ATG TCT GAG CT 3' 
YFK-4: 

5' CAG ACA TGT TCT GGT CAG TTC TCT GCT AAT CGC TCA ACG AAC 
GAT TAA AAT TAA TCC AAA TGT GTT 3 
20 For in-vitro translation of the HCV protease, the new pGEM®- 

3Z/Yellow Fever leader vector was digested with BamHI and blunted with Klenow. 

(B) PvuII Construct from p6000 

A clone p6000 was constructed from sequences available from the 
genomic library of HCV cDNA, ATCC accession no. 40394. The HCV encoding 
25 DNA sequence of p6000 is identical to nucleotide -275 to nucleotide 6372 of Figure 
17 of WO 90/11089, published 4 October 1990. p6000 was digested with PvuII, and 
from the digest, a 2,864 bp fragment was isolated. This 2,864 bp fragment was 
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ligated to the prepared pGEM®-3Z/Yellow Fever leader vector fragment, described 
above. 

Example 11 
(In- Vitro Expression of HCV Protease) 

(A) Transcription 

The pGEM®-3Z/Yellow Fever leader/PvuII vector was linearized with 
Xbal and transcribed using the materials and protocols from Promega's Riboprobe® 
Gemini n Core system. 

(B) Translation 

The RNA produced by the above protocol was translated using 
Promega's rabbit reticulocyte lysate, minus methionine, canine pancreatic microsomal 
membranes, as well as, other necessary materials and instructions from Promega. 



Deposited Biological Materials: 

The following materials were deposited with the American Type Culture 
Collection (ATCC), 12301 Parklawn Dr., Rockville, Maryland: 



Name 

E. coli D1210, cflSODp600 



Deposit Date 
23 Mar 1990 



Accession No. 
68275 



Cf 1/5-1-1 in E. coli D1210 



11 May 1989 



67967 



Bacteriophage A.-gtl 1 cDNA 
library 



01 Dec 1987 



40394 



E. coli HB101, pS356 
plasmid DNA, pDM15 



29 Apr 1988 
05 May 1988 



67683 



40453 



S. cerevisae, 2150-2-3 
(pAB24-GAP-env2) 



23 Dec 1986 



20827 
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The above materials have been deposited with the ATCC under the accession 
numbers indicated. These deposits will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for purposes 
of Patent Procedure. These deposits are provided as a convenience to those of skill in 
the art, and are not an admission that a deposit is required under 35 U.S.C. §112. The 
polynucleotide sequences contained in the deposited materials, as well as the amino 
acid sequence of the polypeptides encoded thereby, are incorporated herein by 
reference and are controlling in the event of any conflict with the sequences described 
herein. A license may be required to make, use or sell the deposited materials, and no 
such license is granted hereby. 



